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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Bujard, Hermann 

Gossen, Manfred 



(ii) TITLE OF INVENTION: Methods for Regulating Gene Expression 



(iii) NUMBER OF SEQUENCES: 28 

(iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: LAHIVE & COCKFIELD 

(B) STREET: 60 State Street, suite 510 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 
20 (F) ZIP: 02109-1875 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWAPE: ASCII Text 

(vi) CTJRRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 
30 (B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/383,754 
35 (B) FILING DATE: 03 -FEB- 1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

<A) APPLICATION NUMBER: US 08/275,876 
40 (B) FILING DATE: 15 -JULY- 94 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/270,637 
45 (B) FILING DATE: 01- JULY- 94 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/260,452 
50 (B) FILING DATE: 14 -JUNE- 94 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/076,327 
55 (B) FILING DATE: 14-JUNE-93 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/076,726 
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(B) FUSING DATE: 14 -JUNE- 93 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: DeConti, Giulio A. Jr. 

(B) REGISTRATION NUMBER: 31,503 

(C) REFERENCE/DOCKET NUMBER: BBI-009CP6 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617)227-7400 

(B) TELEFAX: (617)227-5941 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1008 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1..1008 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 1..1008 

(ix) FEATURE: 

(A) NAME/KEY: misc. binding 

(B) LOCATION: 1..207 

(ix) FEATURE: 

(A) NAME/KEY: misc. binding 

(B) LOCATION: 208.. 335 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1005 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG CTG 
Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 



CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 
Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 



AAG CTA GGT GTA GAG CAG CCT ACA CTG TAT TGG CAT GTA AAA AAT AAG 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 
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CGG <3CT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 
Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

ACT CAC TTT TGC CCT TTA AAA GGG GAA AGC TGG CAA GAT TTT TTA CGC 
Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Ph6 Leu Arg 
^5 70 75 80 

AAT AAG GOT AAA AGT TTT AGA TGT GOT TTA OTA AGT CAT CGC AAT GGA 
Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 



GCA AAA GTA CAT TCA GAT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 
Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
15 100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

20 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

25 GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 
Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 
30 Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
35 180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC GCG 
lie He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser Ala 
195 200 205 

40 

TAG AGC CGC GCG CGT ACG AAA AAC AAT TAC GGG TCT ACC ATC GAG GGC 
Tyr Ser Arg Ala Arg Thr Lys Asn Asn Tyr Gly Ser Thr He Glu Gly 
210 215 220 

45 CTG CTC GAT CTC CCG GAC GAC GAC GCC CCC GAA GAG GCG GGG CTG GCG 
Leu Leu Asp Leu Pro Asp Asp Asp Ala Pro Glu Glu Ala Gly Leu Ala 
225 230 235 240 

GCT CCG CGC CTG TCC TTT CTC CCC GCG GGA CAC ACG CGC AGA CTG TCG 
50 Ala Pro Arg Leu Ser Phe Leu Pro Ala Gly His Thr Arg Arg Leu Ser 
245 250 255 

ACG GCC CCC CCG ACC GAT GTC AGC CTG GGG GAC GAG CTC CAC TTA GAC 
Thr Ala Pro Pro Thr Asp Val Ser Leu Gly Asp Glu Leu His Leu Asp 
55 260 265 270 

GGC GAG GAC GTG GCG ATG GCG CAT GCC GAC GCG CTA GAC GAT TTC GAT 
Gly Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp Asp Phe Asp 
275 280 285 



-78- 



CTG GAC ATG TTG GGG GAC GGG GAT TCC CCG GGT CCG GGA TTT ACC CCC 912 
Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly Phe Thr Pro 
290 295 300 



CAC GAC TCC GCC CCC TAC GGC GCT CTG GAT ATG GCC GAC TTC GAG TTT 960 
His Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met Ala Asp Phe Glu Phe 
305 310 



315 



320 



10 GAG GAG ATG TTT ACC GAT CCC CTT GGA ATT GAC GAG TAC GGT GGG TAG 1008 
Glu Gin Met Phe Thr Asp Pro Leu Gly lie Asp Glu Tyr Gly Gly 
325 330 335 

(2) INFORMATION FOR SEQ ID N0:2: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
1 5 10 15 

Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
25 20 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

30 Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
70 75 



35 



80 



40 



Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 
85 90 95 

Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

45 Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Glv Cvs 
130 135 140 ^ ^ 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
50 ISO 1S5 160 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 
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ile lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser Ala 
195 200 205 

Tyr Ser Arg Ala Arg Thr Lys Asn Asn Tyr Gly Ser Thr lie Glu Glv 
210 215 220 

Leu Leu Asp Leu Pro Asp Asp Asp Ala Pro Glu Glu Ala Gly Leu Ala 
230 235 240 

Ala Pro Arg Leu Ser Phe Leu Pro Ala Gly His Thr Arg Arg Leu Ser 
245 250 255 

Thr Ala Pro Pro Thr Asp Val Ser Leu Gly Asp Glu Leu His Leu Asd 
265 270 

Gly Glu Asp val Ala Met Ala His Ala Asp Ala Leu Asp Asp Phe Asp 
275 280 285 

Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly Phe Thr Pro 
290 295 300 

His Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met Ala Asp Phe Glu Phe 
310 315 320 

Glu Gin Met Phe Thr Asp Pro Leu Gly He Asp Glu Tyr Gly Gly 
325 330 335 



INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA 

(xi) .SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GAC GCG CTA GAC GAT TTC GAT CTG GAC ATG TTG 
Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



-80- 

(v) FRAGMEKT TYPE: internal 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Asp Ala lieu Asp Asp Phe Asp Leu Asp Met Leu 
IS 10 

10 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v> FRAGMENT TYPE: internal 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

25 Met Pro Lys Arg Pro Arg Pro 

1 S 

(2) INFORMATION FOR SEQ ID NO: 6: 

30 _ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: DNA 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

GAATTCGGGG CCGCGGAGGC TGGATCGGTC CCGGTGTCTT CTATGGAGGT CAAAACAGCG 60 

TGGATGGCGT CTCCAGGCGA TCTGACGGTT CACTAAACGA GCTCTGCTTA TATAGGTCGA 120 

45 

GTTTACCACT CCCTATCAGT GATAGAGAAA AGTGAAAGTC GAGTTTACCA CTCCCTATCA 180 

GTGATAGAGA AAAGTGAAAG TCGAGTTTAC CACTCCCTAT CAGTGATAGA GAAAAGTGAA 240 

50 AGTCGAGTTT ACCACTCCCT ACCAGTGATA GAGAAAAGTG AAAGTCGAGT TTACCACTCC 300 

CTATCAGTGA TAGAGAAAAG TGAAAGTCGA GTTTACCACT CCCTATCAGT GATAGAGAAA 360 

AGTGAAAGTC GAGTTTACCA CTCCCTATCA GTGATAGAGA AAAGTGAAAG TCGAGCTCGG 420 

55 

TACCCGGGTC GAGTAGGCGT GTACGGTGGG AGGCCTATAT AAGCAGAGCT CGTTTAGTGA 480 

ACCGTCAGAT CGCCTGGAGA CGCCATCCAC GCTGTTTTGA CCTCCATAGA AGACACCGGG 540 



20 
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ACCGATCCAG CCTCCGCGGC CCCGAATTC 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dotible 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AGATCTGCAG GGTCGCTCGG TGTTCGAGGC CACACGCGTC ACCTTAATAT GCGAAGTGGA 60 

CCGGATCTCG AGTTTACCAC TCCCTATCAG TGATAGAGAA AAGTGAAAGT CGAGTTTACC 120 

ACTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA CCACTCCCTA TCAGTGATAG 180 

AGAAAAGTGA AAGTCGAGTT TACCACTCCC TATCAGTGAT AGAGAAAAGT GAAAGTCGAG 240 

TTTACCACTC CCTATCAGTG ATAGAGAAAA GTGAAAGTCG AGTTTACCAC TCCCTATCAG 300 

TGATAGAGAA AAGTGAAAGT CGAGTTTACC ACTCCCTATC AGTGATAGAG AAAAGTGAAA 360 

GTCGAGCTCG GTACCCGGGT CGAGTAGGCG TGTACGGTGG GAGGCCTATA TAAGCAGAGC 42 0 

TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 48 0 
AAGACACCGG GACCGATCCA GCCTCCGCGG CCCCGAATTC 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HuiHcin cytomegalovirus 

(B) STRAIN: K12, Towne 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 382.. 450 



520 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GAATTCCTCG AGTTTACCAC TCCCTATCAG TGATAGAGAA AAGTGAAAGT CGAGTTTACC 
55 ACTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA CCACTCCCTA TCAGTGATAG 



60 
120 
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AGaiRAAGTGA AAGTCGAGTT TACCACTCCC TATCAGTGAT AGAGAAAAGT GAAAGTCXa^G lao 

TTTACCACTC CCTATCAGTG ATAGAGAAAA GTGAAAGTCG AGTTTACCAC TCCCTATCAG 240 

TGATAGAGAA AAGTGAAAGT CX3AGTTTACC ACTCCCTATC AGT6ATAGAG AAAAGTGAAA 300 

GTCGAGCTCG GTACCCGGGT CGAGTAGGCG TGTACGGTGG GAGGCCTATA TAAGCAGAGC 360 

5 TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 420 

AAGACACCGG GACCGATCCA GCCTCCGCGG 4 50 



(2) INFORMATION FOR SEQ ID NO .-9: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

15 {ii) MOLECULE TYPE: UNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGAICISM: Human cytomegalovirus 

(B) STRAIN: Towne 

20 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 382.. 450 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GAATTCCTCG ACCCGGGTAC CGAGCTCGAC TTTCACTTTT CTCTATCACT GATAGGGAGT 60 

GGTAAACTCG ACTTTCACTT TTCTCTATCA CTGATAGGGA GTGGTAAACT CGACTTTCAC 120 

TTTTCTCTAT CACTGATAGG GAGTGGTAAA CTCGACTTTC ACTTTTCTCT ATCACTGATA 180 

GGGAGTGGTA AACTCGACTT TCACTTTTCT CTATCACTGA TAGGGAGTGG TAAACTCGAC 240 

30 TTTCACTTTT CTCTATCACT GATAGGGAGT GGTAAACTCG ACTTTCACTT TTCTCTATCA 300 

CTGATAGGGA GTGGTAAACT CGAGTAGGCG TGTACGGTGG GAGGCCTATA TAAGCAGAGC 360 

TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 42 0 

AAGACACCGG GACCGATCCA GCCTCCGCGG 450 



35 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 

(ii) MOIiECDLE TYPE: DNA (genomic) 

(vi) ORIGINAL SOTOCE: 
5 (A) ORGANISM: Herpes Simplex Virus 

(B) STRAIN: KOS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GAGCTCGACT TTCACTTTTC TCTATCACTG ATAGGGAGTG GTAAACTCGA CTTTCACTTT 

10 TCTCTATCAC TGATAGGGAG TGGTAAACTC GACTTTCACT TTTCTCTATC ACTGATAGGG 

AGTGGTAAAC TCGACTTTCA CTTTTCTCTA TCACTGATAG GGAGTGGTAA ACTCGACTTT 180 
CACTTTTCTC TATCACTGAT AGGGAGTGGT AAACTCGACT TTCACTTTTC TCTATCACTG 24 0 

ATAGGGAGTG GTAAACTCGA CTTTCACTTT TCTCTATCAC TGATAGGGAG TGGTAAACTC 3 00 

GAGATCCGGC GAATTCGAAC ACGCAGATGC AGTCGGGGCG GCGCGGTCCG AGGTCCACTT 360 

15 CGCATATTAA GGTGACGCGT GTGGCCTCGA ACACCGAG 3 98 



60 
120 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 
20 (B> TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ACTTTATCAC TGATAAACAA ACTTATCAGT GATAAAGA 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHTiRACTERISTICS : 
30 (A) LENGTH: 38 base pairs 

(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



ACTCTATCAT TGATAGAGTT CCCTATCAGT GATAGAGA 



(2) INFORMATION FOR SEQ ID NO: 13: 



(i> Sequence csiaracteristics : 

(A) LENGTH: 38 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : do\jble 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AGCTTATCAT CGATAAGCTA GTTTATCACA GTTAAATT 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doilble 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ACTCTATCAT TGATAGGGAA CTCTATCAAT GATAGGGA 



(2) INFORMATION FOR SEQ ID NO: 15: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AATCTATCAC TGATAGAGTA CCCTATCATC GATAGAGA 



(2) INFORMATION FOR SEQ ID NO: 16: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dovtble 
40 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ATG TOT AGA TTA GAT AAA ACT AAA GTG ATT AAC AGC GCA TTA GAG CTG 
Met Ser Arg Ijcu Asp Lys Ser Lys Val He Asn Ser Ala Leu Glu Leu 



CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC GAG 
Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 



AAG CTA GGT GTA GAG GAG COT ACA TTG TAT TGG CAT GTA AAA AAT AAG 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
15 35 40 45 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 
Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 
50 55 60 

20 

ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG CAA GAT TTT TTA CGT 
Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

25 AAT AAG GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC GAT GGA 
Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

GCA AAA GTA CAT TTA GGT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 
30 Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
35 115 120 125 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

40 

GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 
Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 ISO 155 160 

45 CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 
Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala He Glu Leu 
165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 
50 Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC 
He He Cys Gly I^eu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
55 195 200 205 



(2> INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHAKACTERISTICS : 

(A) LENGTH; 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii) MOLECDLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
15 10 15 

10 Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

15 

Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 
20 65 70 75 80 

Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

25 Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

30 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
35 145 150 155 160 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

40 Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 

45 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG CTG 
Met Ser Arg Leu Asp Lys Ser Lys Val He Asn Ser Ala Leu Glu Leu 



CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 
Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 



AAG CTA GGT GTA GAG CAG CCT ACA CTG TAT TGG CAT GTA AAA AAT TiAG 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 
Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 



ACT CAC TTT TGC CCT TTA AAA GGG GAA AGC TGG CAA GAT TTT TTA CGC 
Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 



AAT AAG GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC AAT GGA 
25 Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 
85 90 95 

GCA AAA GTA CAT TCA GAT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 
Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
30 100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
lis 120 125 

35 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

40 gTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 
Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 
45 Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala He Glu Leu 
165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
50 180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC 
He He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 19: 



-88- 



(i) SEQDHNCE CHARACTERISTICS: 

(A) IiEHGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
1 5 10 15 

10 Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

15 

Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 



Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
20 65 70 75 80 

Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 
85 90 95 

25 Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

30 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
35 145 150 155 160 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

40 Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

lie He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 

45 

(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GAG ATG GAA AAA GCG ACA CCG GAG ACQ ATG GTC CAT TGG ATT TGT CTG 
Asp Met Glu Lys Ala Thr Pro Glu Thr Met Val His Trp He Cys Leu 
1 5 10 15 

AAG ATG GAG CCA GCT CTG TGG ATG GCC ATT ACA GCA ACA TCG CAC GGC 
Lys Met Glu Pro Ala Leu Trp Met Ala He Thr Ala Thr Ser His Gly 
20 25 30 

GCA AGG CAC AGG ACA TTC GTC GGG TTT TCC GGC TGC CTC CAC CGC AAA 
Ala Arg His Arg Thr Phe Val Gly Phe Ser Gly Cys Leu His Arg Lys 
35 40 45 

TCC CTC ACG TAC CCA GTG ATA TGC CTG AGC AAA CCG AGC CAG AGG ATT 
Ser Leu Thr Tyr Pro Val He Cys Leu Ser Lys Pro Ser Gin Arg He 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECtJLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



Asp Met Glu Lys Ala Thr Pro Glu Thr Met Val His Trp He Cys Leu 
15 10 15 

Lys Met Glu Pro Ala Leu Trp Met Ala He Thr Ala Thr Ser His Gly 
20 25 30 

Ala Arg His Arg Thr Phe Val Gly Phe Ser Gly Cys Leu His Arg Lys 
35 40 45 

Ser Leu Thr Tyr Pro Val He Cys Leu Ser Lys Pro Ser Gin Arg He 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
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20 



CTG GAC GAC TCG AAG CGC GTA GCC AAG CGG AAG CTG ATC GAG GAG AAC 
Ijeu Asp Asp Ser Lys Arg Val Ala I>ys Arg Lys I<eu lie Glu 6lu Asn. 



CGG GAG CGG CGA CGC AAG GAG GAG ATG ATC AAA TCC CTG CAG CAC CGG 
Arg Glu Arg Arg Arg Lys Glu Glu Met He Lys Ser Leu Gin His Arg 



CCC AGC GCC AGO GCA GAG GAG TGG GAG CTG ATC CAC GTG GTG ACC GAG 
Pro Ser Pro Ser Ala Glu Glu Trp Glu Leu He His Val Val Thr Glu 



GOG CAC CGC AGC ACC AAC GCG CAG GGC AGC CAC TGG AAG CAG AGG AGG 
Ala His Arg Ser Thr Asn Ala Gin Gly Ser His Trp Lys Gin Arg Arg 



AAA TTC CTG CTC GAA GAT ATC GGT CAG TCG CCC ATG GCC TCC ATG CTT 
Lys Phe Leu Leu Glu Asp He Gly Gin Ser Pro Met Ala Ser Met Leu 



GAC GGG GAC AAA GTG GAC CTG GAG GCG TTC AGC GAG TTT ACA AAA ATC 
Asp Gly Asp Lys Val Asp Leu Glu Ala Phe Ser Glu Phe Thr Lys He 



25 ATC ACG CCG GCC ATC ACC CGC GTG GTC GAC TTT GCC AAA AAC CTG CCC 
He Thr Pro Ala He liir Arg Val Val Asp Phe Ala Lys Asn Leu Pro 
100 105 110 

ATG TTC TCG GAG CTG CCG TGC GAG GAT CAG ATC ATC CTG CTG AAG GGC 
30 Met Phe Ser Glu Leu Pro Cys Glu Asp Gin He He Leu Leu Lys Gly 
115 120 " 125 

TGC TGC ATG GAG ATC ATG TCG CTG CGC GCC GCC GTG CGC TAC GAC CCC 
Cys Cys Met Glu He Met Ser liCu Arg Ala Ala Val Arg Tyr Asp Pro 
35 130 135 140 

GAG AGC GAA ACG CTG ACG CTG AGC GGG GAA ATG GCC GTC AAA CGC GAG 
Glu Ser Glu Thr Leu Thr Leu Ser Gly Glu Met Ala Val Lys Arg Glu 
145 150 155 160 

40 

CAG TTG AAG AAC GGA GGG CTG GGG GTC GTG TCT GAT GCC ATC TTC GAC 
Gin Leu Lys. Asn Gly Gly Leu Gly Val Val Ser Asp Ala He Phe Asp 
165 170 175 

45 CTC GGC AAG TCG CTG TCT GCC TTC AAC CTG GAC GAC ACC GAG GTG GCC 
Leu Gly Lys Ser Leu Ser Ala Phe Asn Leu Asp Asp Thr Glu Val Ala 
180 185 190 

CTG CTG CAG GCC GTG CTG CTC ATG TCC TCA GAC CGG ACG GGG CTG ATC 
50 Leu Leu Gin Ala Val Leu Leu Met Ser Ser Asp Arg Thr Gly Leu He 
195 200 205 

TGC GTG GAT AAG ATA GAG AAG TGC CAG GAG TCG TAC CTG CTG GCG TTC 
Cys Val Asp Lys He Glu Lys Cys Gin Glu Ser Tyr Leu Leu Ala Phe 
55 210 215 220 

GAG CAC TAC ATC AAC TAC CGC AAA CAC AAC ATT CCC CAC TTC TGG TCC 
Glu His Tyr He Asn Tyr Arg Lys His Asn He Pro His Phe Trp Ser 
225 230 235 240 
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AAG CTG CTG ATG AAG GTG GCG GAC CTG CX3C ATG ATC GGC GCC TAG CAC 768 
Lys Leu Leu Met Lys Val Ala Asp Leu Arg Met He Gly Ala Tyr His 
245 250 255 

5 

GCC AGC CGC TTC CTG CAC ATG AAG GTG GAG TGC CCC ACC GAG CTC TCC 816 
Ala Ser Arg Phe Leu His Met Lys Val Glu Cys Pro Thr Glu Leu Ser 
260 265 270 

10 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Leu Asp Asp Ser Lys Arg Val Ala Lys Arg Lys Leu lie Glu Glu Asn 
20 1 5 10 15 

Arg Glu Arg Arg Arg Lys Glu Glu Met lie Lys Ser Leu Gin His Arg 
20 25 30 

25 Pro Ser Pro Ser Ala Glu Glu Trp Glu Leu He His Val Val Thr Glu 
35 40 45 

Ala His Arg Ser Thr Asn Ala Gin Gly Ser His Trp Lys Gin T^g Arg 
50 55 60 

30 

Lys Phe Leu Leu Glu Asp He Gly Gin Ser Pro Met Ala Ser Met Leu 
65 70 75 80 

Asp Gly Asp Lys Val Asp Leu Glu Ala Phe Ser Glu Phe Thr Lys He 
35 85 90 95 

He Thr Pro Ala He Thr Arg Val Val Asp Phe Ala Lys Asn Leu Pro 
. 100 105 110 

40 Met Phe Ser Glu Leu Pro Cys Glu Asp Gin He He Leu Leu Lys Gly 
115 120 125 

Cys Cys Met Glu He Met Ser Leu Arg Ala Ala Val Arg Tyr Asp Pro 
130 135 140 

45 

Glu Ser Glu Thr Leu Thr Leu Ser Gly Glu Met Ala Val Lys Arg Glu 
145 150 155 160 

Gin Leu Lys Asn Gly Gly Leu Gly Val Val Ser Asp Ala He Phe Asp 
50 165 170 175 

Leu Gly Lys Ser Leu Ser Ala Phe Asn Leu Asp Asp Thr Glu Val Ala 
180 185 190 

55 Leu Leu Gin Ala Val Leu Leu Met Ser Ser Asp Arg Thr Gly Leu He 
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5 



25 



45 



195 200 205 

Cys Val Asp Lys lie Glu Lys Cys Gin Glu Ser Tyr Leu Leu Ala Phe 
210 215 220 

Glu His Tyr lie Asn Tyr Arg Lys His Asn He Pro His Phe Trp Ser 
225 230 235 240 



Lys Leu Leu Met Lys Val Ala Asp Leu Arg Met He Gly Ala Tyr His 
10 245 250 255 

Ala Ser Arg Phe Leu His Met Lys Val Glu Cys Pro Thr Glu Leu Ser 
260 265 270 

15 (2) INFORMATION FOR SEQ ID NO: 24: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24. 
TCCCCGGGTA ACTAAGTAAG GATCC 2 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: , nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AGTGGGTCCC CGGGTGACAT GGAA 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
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Ser Gly Ser Pro Gly Asp Met Glu 



(2) INFORMATION FOR SEQ ID NO; 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27 
AGTGGGTCCC CGGGTCTGGA CGAC 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
Ser Gly Ser Pro Gly Leu Asp Asp 



